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Abstract 

Poisson distributed shot noise is normally considered in the Gaussian limit in 
cosmology. However, if the shot noise is large enough and the correlation func- 
tion/power spectrum conspires, the Gaussian approximation mis-estimates the er¬ 
rors and their covariance significantly. The power spectrum, even for initially Gaus¬ 
sian densities, acquires cross correlations which can be large, while the change in 
the correlation function error matrix is diagonal except at zero separation. Two and 
three dimensional power law correlation function and power spectrum examples 
are given. These corrections appear to have a large effect when applied to galaxy 
clusters, e.g. for SZ selected galaxy clusters in 2 dimensions. This can increase the 
error estimates for cosmological parameter estimation and consequently affect sur¬ 
vey strategies, as the corrections are minimized for surveys which are deep and 
narrow rather than wide and shallow. In addition, a rewriting of the error matrix 
for the power spectrum/correlation function is given which eliminates most of the 
Bessel function dependence (in two dimensions) and all of it (in three dimensions), 
which makes the calculation of the error matrix more tractable. This applies even 
when the shot noise is in the (usual) Gaussian limit. 


In cosmology, continuous density fields are used to describe distributions of 
discrete objects such as galaxy clusters. The actual discrete number counts 
are taken to be a sampling of this continuous distribution resulting in the 
appearance of shot noise. Usually the shot noise is taken to have a Gaussian 
distribution, considered as the large number approximation to an underlying 
Poisson distribution (Layzer ( 19561) . see the textbook by Peebles (jl980h for a 
review). The interest here is when the Gaussian approximation for the shot 
noise breaks down. This paper considers in detail the specific case where the 
shot noise distribution is in fact Poisson PI It is seen that the Poisson nature 
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2 It has been shown that Poisson sampling is accurate for dark matter halos in 
numerical simulations in regions where the density is not too high (Casas-Miranda 
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of the shot noise can produce significant modifications of the Gaussian shot 
noise error matrix of the correlation function and power spectrum. 


The full Poi sson power spectrum error matrix was calculated by Meiksin & 
White f 1999f l. In this paper it is angle averaged, compared to usual expressions, 
and applied to 2 and 3 dimensional power law examples comparable to SZ 
selected galaxy cluster surveys. The effect on the explicit two dimensional SZ 
selected galaxy cluster power spectrum is also found and is seen to be large 
for some realistic survey parameters. These increases to the error matrix have 
consequences both for cosmological parameter estimates and for designing 
surveys (the additions only appear if the shot noise is big enough, which can 
be alleviated by going deeper for an SZ selected survey). 


Section one has de finitio ns and the two dimensional angle averaging of the 
Meiksin & White 019991 ) result. Three dimensions are considered in section 
two, and section three discusses a rewriting of the error matrix which mini¬ 
mizes the appearance of Bessel functions and their instabilities. This rewriting 
is also useful for the Gaussian shot noise case. In section four, two and three 
dimensional power law examples are given, as well as an application to a two 
dimensional SZ selected galaxy cluster power spectrum. Section five concludes. 


The rarity of galaxy clusters and the possible_need for Poisson shot noise has 
been recognized before, e.g. in Lima & Hu 020041 1 the full Poisson probability 
function is used to obtain constraints from cluster number counts in conjunc¬ 
tion with their scatter. 


1 The effect: two dimensions 


One can characterize a two dimensional density distribution via its correlation 
function 

(h(r)h(r + 0)) (1) 

where 6 refers to the two dimensional angular separation (6, (p) and the hat 
denotes an operator. There are actually two averages in the above expression 
for discrete objects, as the density distribution is treated as a continuous field 
sampled from the underlying discrete distribution, here the sampling is taken 
to be Poisson. The first average is over this Poisson sampling, and the second 
is a sample average, usually assumed to be the same as volume average. The 


et al (2002j)); the transition density depends upon the dark halo mass of interest. 
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Poisson and sampling average of two density perturbations give£i| 


9 )} = w(0) + ( 2 ) 

where 5^(0) is a two-dimensional Dirac delta function, and A f is the number 
density of objects. The angular correlation function w(8) is defined as the 
probability above random for finding two objects with one in area Di and the 
other in area f2 2 

P 12 = Af 2 (l + w(6))dn id0 2 (3) 


The operator w is usually defined with the shot noise subtracted out: 


w[Q ) = 8(f)8(r + 6) 


5 d 0 ) 

M 


( 4 ) 


so that (w(8)) = w(6). 

In practice the measured correlation function will also depend on window 
functions of the survey W(9), 

w(9) = j±f d 2 8' J d 2 9"W{8')W{8”)8{8')8{8")5 D {8' - 8" - 9) - ^ 

A{9) = f d 2 8 l d 2 9"W(8')W(9")8 D (9' - 8" - 8) . 


In the work here the window function is taken to be one inside the region of the 
survey and zero elsewhere. This simple case illustrates the effect of interest 
for this paper, however in practice more complicated window functions do 
arise and can introduce additional subtleties. For more detailed discussion of 


window functions and errors see Eisenstein & Zaldarriaga (J2001). 


There is also the Fourier transform of void), the power spectrum, 


P 2 (K) = / d 2 9e- lK - ti w{9) 


( 6 ) 


here K = (Ji, </>*,). The operator P- 2 (K) also has the shot noise subtracted out 
(in this case a constant). The operators 8(8), w{8) are dimensionless, while 
Po(K) is in terms of steradians. 

As isotropy is usually assumed in K and 9 space (this can mean neglecting 


3 For more detailed discussion in Fourier space for the conventions used here see, 
e.g., (Il999h . 
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boundary effects in particular), both w and P 2 are usually angle averaged: 


PAK) = ± !d<t> K Pi(K) 


The error matrix of the correlation function and power spectrum can be cal¬ 
culated by going back to the definitions of w{9) and P(K ) in terms of the 
density. Meiksin & White ( 1999 ) calculated the Poisson contribution to the 
error rnatri?P~l in the power spectrum using counts in cells (Peebles, ( 198dh 
§36). For Gaussian densities S the power spectrum error matrix is 


(P 2 (k)P 2 (k')) - (P 2 (k))(P 2 (k')) = ( p 2 (k) + + <$*_*,) 

+^(P 2 (\K + K>\) + P 2 (\K - K’\) + 2 P 2 (K) + 2 P 2 (K')) 
4_L_ 

(8) 

Here (and later in the text) 5 a b refers to a Kronecker 6 function. If the densities 
themselves are non-Gaussian, there are additional terms depending upon the 
three and four point functions, 

h T (K, -K, K\ -K') + -K, 0) + B( 0, K 1 , -K’) 

+B(K + AT -K, -IP) + B(K - K\ -K, IP) + B(K, IP - I<. -IP) + B(I<, -K - IP , IP)) 

( 9 ) 

which are Fourier transforms of the spatial four and three point functions rj 
and ( respectively. These additional terms can be angle averaged (and Fourier 
transformed) straightforwardly and thus are not shown in the following. They 
are needed however if the density distribution is non-Gaussian. 

There is also an additional term ^tI^P 2(0), which may be nonvanishing for 
some choices of window function. As it is a constant it just modifies 

+ ■ a /’p 2 (°)) (10) 

in the above and in its averages/limits below. The same multiplicative factor 
(1 + J\fP 2 (0)) (if different from one) occurs in the the last term in the error 
matrix for the two dimensional correlation function. For three dimensions the 
last term is instead modified by (1 + A/"P(0)). It will not be displayed in the 
following but can be added in directly if non-zeroU] 


1 The two dimensional version is shown below, i.e. take V —> H in their result. 
5 I thank the anonymous referee for making this point. 
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To convert this expression to errors and covariances for the angle averaged 
correlation function and power spectrum, first take the continuum limit. As 


E 


Q 


(2tt) 


d 2 K and E E k> = 1> 


( 11 ) 


one has the correspondence 


, (2?r) 


-5 d (K - K') 


( 12 ) 


and similarly for 8g _g,. Thus the two dimensional power spectrum error ma¬ 
trix is 


(P 2 (K)P 2 (K')) - (P 2 (K))(P 2 (K')) = (P 2 (K) + - /?') + 6„(K + A")] 

+ s 4 s [2P 2 (A) + 2P 2 (K') + P 2 (|A + K'\) + P 2 (\K 




(13) 

The Fourier transform of this is the variance of the correlation function w(6): 


(w(6)w{0')) - (w(9))(w(9')) = ±fd 2 9"(w(9~ 9' - 9") + w{9 + 9'- 9"))w0") 

+^[ 2 w0 + 9’) + 2 w0- 9’)\ 

+^[(S d (9- 9') + S D 0 + 0>))(1 + ™0)) 

+2 w0)S D 0) + 2 w0)S D 0)] 

+ ^ 5 d 0 ) 5 d 0 ) 

(14) 

These correlation functions and power spectra are not angle averaged, i.e., they 
depend upon 9 and K rather than 9 , K . To get the more familiar functions of 
only 9 , 9' and K, K' requires an average over </>, <// and by, respectively. (In 
practi ce this means ignoring boundary effects, see Eisenstein and Zaldariagga 
( 20011 ) for discussion.) The correlation function and power spectrum are con¬ 
sidered in turn below. The error matrix for each is angle averaged and either 
binned or discretized. Then each is compared with previous Gaussian expres¬ 
sions in the literature to highlight when and how the additional Poisson-only 
(or “non-Gaussian”) terms are significant. 

For the angular correlation function, angle averaging requires the quantities 

1 f J2S ,n 1 riMo-vp , 2 2 ,N 1 &(«-#') , ir , 

-y = -y#— - e — 5 d (</> -<)>) = — — - — (is) 












and 


j dcj)S D (9 + 9') = dcf) 5 ^ 6 y e ^ 8 D {(t) ~(p' ±n) 


1 5 d {9-9') 
2tt 9 


(16) 


and likewise 

_L/^ D(s - ) = _i_ foW (it) 

These relations allow one to get the angle average of all the terms except those 
containing two w’ s. For those, go to Fourier space to get 


( 2 ^ I dm’ I d 2 9"w(9 - 9' - 9")w(9") = ^ j d(j)d fi j 0 e iK (e-e')p 2 {K) 

= f ^J 0 (K9)J 0 (K9')P 2 (K) 


pJjT I dm' f d 2 9"w{9 + 9' - 9")w(9") = f ^J 0 (K9)J 0 (K9')P 2 (K) 


$ dm'w (9 + 9') = f dm'w (9-9’) 

= I ^J 0 (K9)J 0 (K9’)P 2 (K) 

(18) 


Putting it all together gives 


(mm'))- (amwm 

= h^UmMK9')[P 2 (K) + %P 2 {K)\ + 


M T nM 2 

[2 + 2w(0)^n + 2«>(0')%f ] 


2ir9 


' fW 2 

+ SW 5 ( 27 r 0 )( 2 n 9 f ) ^ D (^)^D (9 ) 


(19) 


Often the error in w is given in terms of n p , the number of pairs. To connect 
with this expression, binning is needed, i.e. one observes not at just one angle 
but in a shell of width <50. This means w(9) is not measured at a point but 
smeared outPH over a region of size A 9: 


f A A9% m + A 9')(9 + A9')d(A9') _ 


w(9) 


+ M')d(M') 


= WA9(9) ■ 


( 20 ) 


6 I thank R. Sheth for correcting an error in this, in the following expression, and 
equation 26. 
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In practice this means that in equation [T9] one must replace 


um - j% 2 Mm+ m'w+ w)d{M) 

= Kko(M)MKe + ) - {q_)mkq_)) 

= JoAe(m 

e ± = e± ao /2 

as well as the explicit functions of w(6),w(6 f ) on the right hand side. The 
continuous 5d{0—O') functions are effectively replaced by Sg^/AO. Recognizing 
2tt0A0 = 59 gives 


{w A e(0)wAo(0')) 


(wAe(0))(w A e(0'))) 


= 1 1 ^j aA ,(mj«Mm\pKK) + £p*(fo] + 



( 22 ) 


This can be rewritten in terms of the number of pairs, n p = |A/" 2 599, 


(w A e(0)w A e(0')) - {w A e(0)) (w A e(0')) 

= h / ^Jo,Ao(mJoAe(KO')[pm) + + i v 


fij 2 t r 

+^ _ [^Ae»(^)<5e i 0' + w A e(0)Sg>fij% + w A e{0')&o,o[ 


1 °e,e' x 
2n v ATSn° 9 ’ 0 


(23) 

and 9 = 4 nf s k y . This shows the often used 1 /'n l J 2 shot noise error estimate in 
the context of the full correlation function error matrix. 


The Poisson rather than Gaussian contributions to the error matrix are the 
terms on the last two lines above. As these terms are proportional to 1 jn v 
they will decrease more quickly with increased binning than the terms involv¬ 
ing Bessel functions, i.e. the fractional contribution of the Poisson terms will 
decrease as the binning increases (assuming the binned w doesn’t increase). 
The main new contribution is the term proportional to 8g : g'W A (0)/n p which is 
significant only when w(0) is ’’large” (relative to one and for large shot noise), 
i.e. it is most important for small 0. The other terms seem less significant as 
they only arise when either 0 or O' are zero. 


The terms analogous to the (1 + w)/n p were found for the angular corr elatio n 
function calculated via the La nd y-Szalay estimator (Landy & Szalay (119931 1) 
and extended by Bernstein ( 19941 1 to the case where the correlation function 


7 













is large enough that the terms depending upon the power are also important. 
For the power spectrum, the angle average gives 


(P 2 {k)P 2 (k')) - (P 2 (k))(P 2 {k')) 


= f + ±) 2 ^{5 d (K - K') + 5 d (K + K')) 

+^[ 2 P 2 (K) + 2 P 2 {K') + P 2 (\K + K '|) + P 2 (\K - K'\)] 


= %wH K ~ K ')^ K ) + f ) 2 

+j^[2P 2 (K) + 2P 2 (K') + 4nf0d9 J o (K0)MK'9)w(0)\ 

i 1 

jv 3 n 

(24) 

The top line, diagonal in K , K 1 ', is the usual contribution when the shot noise 
is Gaussian, rather than Poisson. Here to connect with other formulae in the 
literature one can make K discretfP~l. in particular &n{K — K') —> $k,k/ AK = 

5k,k'- Then, replacing K —> £,5r,k' = S^/',P 2 (K) —> Ci and G = 47 t f s k y , we 
can recognize the standard expression for large £ 

(5C t SC e ) = kr {2t+ 2 1)f JQ + pf ~ (25) 

The completely binned expression is found by replacing PpK) by 

1 rAK/2 _ 

P2<(K) —— / d(AK)(K + AA P 2 (A + AJi) 26 

AAA J-AK/2 

and likewise for P 2 {K r ) on the left hand side of eqn. [24] and averaging the 
functions of K, K' on the right hand side analogously. 

The Poisson terms are non-diagonal and introduce cross correlations in the 
power spectrum. The cross correlation grows with the separation K — K re f up 
to some limiting value, ignoring binning one has 

(P 2 (K„,)Pi(K)) , r3n 2P 3 {K rel )/M" 2 + 1/jV 3 

/- *K\ arge J v , - , 

j(P2(K~f)P2{K r ' f ))mK)p 2 iK)) 

(27) 

This additional covariance makes the power spectrum errors correlated even 
if the density distribution was originally Gaussian. The size of this cross cor¬ 
relation is shown in examples in section four, it can be large. 


' The discretization can be done by binning, i.e. integrating from k — 1/2 to k- 1-1/2 
for each value of k. Assuming 1, below are shown only the first terms in 1/k. 















The errors have both Gaussian and non-Gaussian contributions, the non- 
Gaussian terms become comparable to the Gaussian terms when 2nJ\[ 2 P 2 (K)/K < 
1 and P 2 (K)K > 2i r, when there is no binning. In contrast to the correlation 
function case, increasing the binning increases the importance of the addi¬ 
tional non-Gaussian terms as binning roughly divides the first (i.e. Gaussian) 
term by the bin size relative to the others. In section four it is shown that for 
A/", P 2 (K ) and K of interest for SZ selected galaxy cluster surveys the Poisson 
contribution to the errors can be relatively large. 


2 Three dimensions 


In three dimensions the correlation function £(r) and power spectrum P(k) 
are related via 


P(k) = J d 3 re l *' r £(r) 

«f) = ^ I d 3 ke iif P(k) 


(28) 


To angle average these we again need to assume isotropy and that boundary 
effects aren’t important. In three dimensions, J\f denotes the number density 
per (/r _1 Mpc) 3 . The three dimensional continuum limit of the Meiksin & White 
( 1999 ) error matrix calculation is 


(P(K)P(K')) - (P(K))(P(K')) = (P(K)P(K')) - P(K)P(K‘) 

= P(K) + jr?(MK - A") + S D (lt + K')) 

+7Pv\. P (\K + K'\) + P(\K - K ’|) + 2 P(g) + 2P(K')} 


Fourier transforming to get the correlation function, angle averaging via 


1 

47T 




(30) 


and using 

s D (r- f) = SD(r ~ r 'h 0 (Si - a') 


(31) 
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gives 


(&)&')) - (Hr))(U r')) = 

2 


vk- S k*dk(‘-^)(£f£)P(k) + j^fc(r - r') 
+^[^ 1 ^( 1 “ - r')^(r) + 2{(r) = 4j<5 D (r') + 2£(r') jV.i5 D (r)] 

+WJ^M’');kMd(<)} 

(32) 


The additional non-Gaussian terms are the same as in the two dimensional 
case, i.e. diagonal and proportional to £( r)/n v when binning is included pins 
contributions if either r or r' are zero. In this case n p = J\f 2 VAnr 2 Ar /2. 


For the power spectrum, the angle average gives 


((P(K) - P(K))(P(K') - P(K'))) = $y(P(K) + j;)H D (K - K') 

+w( p ( A ') + P(K') + 4-k J r 2 dr ^ “^'^ (r)) 

l 1 

' J\f 3 V 

(33) 

The error matrix for the power spectrum also has the same structure as in two 
dimensions: the Poisson terms introduce correlations between different values 
of K, and analogously, the non-Gaussian nature of the shot noise will start to 
become important for the errors if < 1 and \K\ 2 P(K) > 47T 2 . With 

binning the first (Gaussian) term gets a factor of one over the size of the bin, 
in three dimensions this coefficient is likely to be less than one. 


3 Errors without Bessel functions 


The error matrices for the power spectrum and the correlation function above 
each depend on both the correlation function and the power spectrum. This is 
less than ideal because going between power spectra and correlation functions 
involves integrals over Bessel functions Jo hi two dimensions and spherical 
Bessel functions jo{x) = sin(a:)/x in three dimensions. Integrals against these 
Bessel functions do not converge quickly; with real data and thus perhaps 
incomplete coverage in real or momentum space the results may not be very 
accurately] If one wished to sidestep this problem by using the measured cor¬ 
relation functions and power spectra together to calculate the errors for either, 


1 There are fast ways of doing integrals against one Bessel function which are 
described at http://casa.colorado.edu/~ajsh/FFTLog/ . 
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one would be combining quantities which have very different estimators and 
thus systematics. 

However, as the correlation function and power spectrum are Fourier trans¬ 
forms of each other, one can instead substitute the definitions until one has 
errors in the correlation functions only in terms of correlation functions or 
similarly for the power spectrum, integrated against an integral of a combina¬ 
tion of regular or spherical Bessel functions. This integral of (spherical) Bessel 
functions will not depend on the power spectrum or correlation function and 
can be calculated separately. For three Bessel functions and three and four 
spherical Bessel functions the exact value of this integral is simple and can be 
used directly, for four Bessel functions the result involves an elliptic integral @ 

This rewriting is only valid when the power spectrum or correlation function(s) 
is/are well behaved enough that the order of integration can be changed. In 
these cases it can provide an immense simplification of the calculation. The 
integral for three (spherical) Bessel functions in two (three) dimensions is ex¬ 
plicitly cut off, and the functions involved are not as oscillatory as Bessel 
functions in all cases where a simple rewriting was found, providing more 
numerical stability for calculating the errors. Unless one uses the elliptic inte¬ 
gral expression for the four Bessel functions, this rewriting doesn’t completely 
eliminate the Bessel functions needed to calculate the error matrix for (even 
a well behaved) w(9). The substitution and integrals are done below, first for 
the two dimensional correlation function and power spectrum and then their 
counterparts in three dimensions. 

For the two dimensional correlation function, the error matrix for w is given 
by equation [19j If the rise of w(9) is shallow enough at the origin, one can 
rewrite the term with just one P 2 (K) via 

m^J 0 (K6)J 0 (K6')P 2 (K) 

= f 9"d9"w{9") f KdKJ 0 (K9)J 0 (K9')J 0 (K9") (34) 

= f9"d9"w(9")f(9,9',9") 


where f(9,9',9") is known in closed form (Jackson & Maximon (J1972)), i.e. 


f(0,9',9") 


A {0,9', 9") 

2? tA(9, 9', 9") ' 


(35) 


9 The author will provide the lengthy expression found, which is still partially in 
i ntegr al form, upon request. The elliptic integral expression is given in e.g. Watson 
( 1996h . page 414 or Van Deun & Cools ( 20061 ). page 594. 


11 












Here 


A(9, 6', 9") 
A(9,9',9") 


1 if 9 + B' > d", e + 9" > 6', 6' + 9" >9 
I if 9 + 9' = 9" or 9 + 9" = 9' or 9' + 9" = 9 


0 otherwise 

\J{9 + 9' + 9"){-B + 9' + 9")(9 -9’ + 9")(9 + 9' - 9") 


(36) 


Geometrically, A is the area of the triangle formed with 3 lines of length 
9, 9' , 0"; A is 1 if the triangle is non-degenerate, 1/2 if it is degenerate and 
zero if no triangle is formed. In particular, A will cut off the integral over 9" 
for 9” > 9 + 9'. For any two sides equalling the third / diverges, but in the 
integral it is an integrable divergence; integrating by parts gives a finite result. 


Unfortunately a simple expression (without elliptic integrals) for 


j elk kJ 0 (M) Jo(ke') J o (k6")J o (k0'") 


(37) 


was not found, so there is still some explicit P 2 (ib) dependence in (ww). 
In terms of f(9, 9', 9")\ 


( w(9)w(9')) — {w(9))(w(9'))) 


= h f ^Jo(K 9)J 0 (K9')P2(K) + ^ f d9”9"w(9")f(9,9 ', 9") 

+n^[2(i + ™(»))US £1 + Me) s -gp + 

— U ) 

(38) 


For the angular power spectrum, there is no quadratic term in w so if P-iiK) 
is not too singular at the origin, its error matrix can be rewritten completely 
in terms of Po(K) itself: 


(P 2 (K)P 2 (K')) - (P 2 (K))(P 2 (K')) = f J^5 D (K- K’)(P 2 (K) + ^) 2 

+^(2P 2 (iF) + 2P 2 {K’)) + j^J K" dK"f{K , K\ K")P 2 {K") 

i i 

m 

where f(K,K',K'') is the same function as in equation 1351 with the replace¬ 
ment 9 —> K etc. Smoothing is done straightforwardly. 

For three dimensions all the integrals over spherical Bessel functions can be 
done exactly, and the applicability of this substitution is limited only by the 
requirement of sufficient falloff of the power spectrum and the correlation 
function at the origin. In terms of these integrals the error matrices from 
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section three become 


(£(r)£(r')) - (£(r))(£(r')) = f f r" 2 dr"r"' 2 dr , "$,(r")$,(r"')g 3 (r, r', r", r" 


8 

fr 2 

VA/V 

1 1 

1 2 A 

1 VM 2 

L 47r r 2 

i 1 

^5. 


47r r : 


(40) 


for the correlation function and 


((P(K) - P(K))(P(K') - P(K'))) = $y(P(K) + jj)H D (K - K') 

+jfe(P(K) + P(K') + | /„“ AK"K"ip(K«)h(K , K', it")) 

l 1 

(41) 

for the power spectrum. 

Here 

/ 3 (r r' r") = / 0 °° }z 2 dk sin ^ kr ^ sin ( fcr '^ sin ( fcr ") 

= |(—sgn(r — r' — r") + sgn(r + r' — r") + sgn(r — r' + r") — sgn(r + r' + r")) 

= |(—sgn(r — r' — r") + sgn(r + r' — r") + sgn(r — r' + r") — 1) 


„ ( r r t r " r M\ _ roo u‘2 rJ U sin(fcr) sin(fcr') sin(fcr") sin(fcr"') 
i/3v j ' j ' j ' ) Jo ™ k r hr' kr" hr'" 

- ^ | y y^ y^ i yM | | y j y^ y^ y^? | | y y^ j y^ yM | j | y j y? j y?? y^ 

_ | y y^ y ,f j y^ | j | y j y? y^ j y^^ | j | y y? j y^ j y^^ | ^y j y? j y^ j \ 

(42) 


As mentioned above, these rewritings have many advantages besides getting 
rid of the instabilities due to oscillations of the (spherical) Bessel functions. 
The functions / and / 3 cut off the integrals for large argument: i.e. in two 
dimensions f(6,6',6") is zero when 9" > 6 + 6' and in three dimensions, e.g., 
for K = K' in the power spectrum, the term with / 3 becomes 

- [°° dK"K” 2 P(K") — (1 + sgn(2K - K")) = - dK"K" 2 P(K") (43) 

n Jo 8 2 Jo 

with an analogue that can be seen by inspection for K ^ K'. These cutoffs 
reduce the dependence on the asymptotic values of the power spectrum or the 
correlation function. The function y 3 (r, r', r”, r 1 ") also goes to zero as r" or r'" 
goes to infinity, but as both parameters are varying some care must be taken 
in the joint limits. 
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4 Examples 


In this section the full Poisson error matrices given above are calculated for 
a power law power spectrum in two and three dimensions and compared to 
the usual Gaussian error matrix. In addition, the effects on the angular power 
spectrum errors for SZ selected galaxy clusters are shown. For these exam¬ 
ples the corrections are significant except for the two dimensional correlation 
function. The signal is taken to be Gaussian; if not, the three and four point 
functions mentioned earlier would need to be included. 

Starting with two dimensions, consider a power law power spectrum 

P 2 (K) = A k k n (44) 

with corresponding correlation function w{6) = Ag6 ~ n ~ 2 . For the error matrix 
to converge, — 1 < n < —1/2, take n = —0.9 for illustration, A k = 5.0 x 10~ 4 
and thus Ag = 7.0 x 10~ 5 . This is a rough fit to the power spectrum for SZ 
selected clusters for Y rnin = 1.7x 10 -5 such as might be seen with APEX0. We 
vary the number density of objects per steradian, J\f and the binnings A 6 and 
AK. The objects are taken to have a Gaussian distribution, and f sky = 1/47T, 
i.e. one steradian. The terms in the error matrix all increase by a multiplicative 
factor 1/fsky for other values of f sky . 

The true cluster case differs from this in three ways: P 2 (K) and w{6 ) are not 
true power laws, the number density is fixed (for clusters with the above Y min , 
A f ~ 33000 is expected) and the cluster distribution is not expected to be 
Gaussian to arbitrarily small scales. The cluster powe r sp ectrum and number 
densities for an experiment such as APEX or Planck 11 1 are shown later on, 
however the signal is still taken to be Gaussian and f sky remains 1/47T. 

For terminology, error means the square root of the diagonal (i.e. K = K' 
for power) term in the error matrix. The “shot noise” error means the term 
n^ 1 / 2 in the error of w(Q) and the term proportional to br the error of 

P 2 (K). The “Gaussian error” refers to the terms which arise when the shot 
noise is considered to be Gaussian rather than Poisson, i.e. the first line on 
the right in equations [23] and [24] and their three dimensional generalizations. 
The “non-Gaussian” error or “Poisson” error is everything else in the error. 
Similarly, “non-Gaussian” and “Poisson” refer to the analogous terms in the 
rest of the error matrix. 

Figure one shows the effects for the correlation function. The delta function 
errors which only appear at the origin have not been included. At top left is 

10 http: //bolo.berkeley.edu/apexsz/index.html 

11 http://astro. estec.esa.nl/SA-general/Projects/Planck 
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the angular correlation function for J\f = 35000 with shot, Gaussian and full 
error bars. The smoothing is 5 arcmin, i.e. the spacing between the error bars. 
The curved line is the correlation function for the analogous SZ cluster sample 
with Y min = 1.7 x 10~ 5 . Figure 1 upper right and lower left show the fraction 
of shot noise alone to the total error and Gaussian error to total error as a 
function of changing J\f respectively. As can be seen, although the shot noise 
is a significant source of the error, using only the shot noise underestimates 
the error by some noticeable fraction unless the shot noise is extremely large 
(A f = 5000 here). The non-Gaussian terms in the error are an equally small 
proportion of the total error for all three values of J\f considered here. The 
dominance of the shot noise in the error means that the cross correlation 
decreases as the shot noise increases, as can be seen in the lower right hand 
panel for the three different J\f values and also the case with binning of 15 
arcmin rather than 5 arcmin. As mentioned earlier, increased binning reduces 
the relative contribution of the shot noise error to the total error and increases 
the cross correlation for a fixed number density. 

For the angular power spectrum, the corresponding quantities are shown in 
figure two. Here smoothing is in equal intervals in log K. The curved line is the 
expected angular power spectrum for the SZ selected cluster survey mentioned 
above. The non-Gaussian contribution to the error matrix is very important! 
At upper left are the shot only, Gaussian and full errors for A f = 35000, the 
curved line is the reference SZ selected galaxy cluster angular power spectrum 
mentioned earlier. Upper right and lower left are the ratios of the shot noise 
only and Gaussian errors only to the total error as a function of K, and lower 
right is the cross correlation. The cross correlation is large and is entirely 
induced by the Poisson nature of the shot noise given our assumptions that r] 
and ( are zero. The effect of the binning, as K > 1, is to make the Gaussian 
contribution even a smaller fraction of the total error matrix. Binning in equal 
size bins in K rather than log K increases the Gaussian contribution to the 
error at larger K (as the bins are smaller at large K in this case), however 
the Poisson nature of the shot noise still makes the ratio of the Gaussian to 
full error small. The case for J\f = 35000 and 20 equal bins in K rather than 
log K is shown to illustrate this. The change induced by using the Poisson 
rather than Gaussian error matrix is large, for example, for K = 2000 and 
A f = 35000 the error bar increases by close to a factor of two, and the cross 
correlation with K = 469 is ~ 0.5 rather than zero. 


The case of SZ selected galaxy clusters with specific modeling and cosmolog¬ 
ical assumptions can be done analogously: i.e. with the Evrard mass func¬ 
tion (Evrard et al ( 20021 )). She th-To rmen bias (Sheth & Tormen ( 1999 )). 
Eisenstein-Hu (Eisenstein & Hu ( 19971) ) transfer function, T* z = 1.2 keV and 
<j 8 = 0.9, il rn = 0.3. Details of the calculation to get this curve and assump¬ 
tions can be found in Cohn & Kadota (120051 ). The error matrix for Y m i n = 
1.7 x 10 -5 is extremely close to those for the J\f = 35000 example in figure one. 
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Figure three shows the fraction of total errors given by the Gaussian approxi¬ 
mation, and the cross correlation for Y min = 1.7 x ICC 5 , 3.4 x 10 -5 ,1.0 x 1CT 4 
(with J\f determined by Y min and the other parameters mentioned before). 
A Y min = 3.4 x 10- 5 might arise if the survey was wider and shallower, and 
Y m in = 10 -4 is a rough estimate of the errors for Planck (even smaller effective 


Ymin might arise once cluster hireling is applied (Geisbusch et al (120041 ). for 
example). The above is for the power spectrum. For the correlation function, 
the Poisson corrections are small, similar to the power law example^ For in¬ 
stance, in estimates pertaining to Planck (e.g. Mei & Bartlett (2003])) with 
Ymin — 1-0 x 10 -4 , the shot noise only approximation to the error is quite 
good, within the 5% of the full Poisson error for 9 > 30'. 

In three dimensions the value of K for clusters is smaller and £(r) is larger, so 
the Gaussian terms become more dominant for the power spectrum and less 
so for the correlation function. An unbinned power law case is shown here just 
to illustrate the (large) effects of the full Poisson treatment. Consider a rough 
power law approximation to the correlation function for galaxy clusters (e.g. 
Bahcall et al ( 2004 1. for M > 1 x lO 14 /r _1 M 0 ), i.e. take £(r) = 100r -1,9 so that 
P{k) = 1.9 x 10 3 k~ 11 (h~ 1 Mpc) 3 . In figure four, the ratio of the Gaussian to 
total errors and the cross correlation are shown for the correlation function and 
the power spectrum, with J\f = ICC 3 ,10~ 4 ,ICC 5 . For the correlation function 
the Gaussian errors differ from the Poisson errors by a factor of £(r)/n p and 
are a less accurate approximation for the correlation function error than in 
two dimensions. One expects of order ICC 4 — ICC 5 clusters, with mass above 
lO 14 /r _1 M 0 per cubic h~ l Mpc\ if the mass cut is higher the density is lower 
and the shot noise effects and the correlation function/power spectrum are all 
larger. The Poisson effects on the error matrix appear to be large unless the 
shot noise is very small. 


For both two and three dimensions, an estimate of when non-Gaussian effects 
in the (angular) power spectrum itself become important is often given by 
considering A \{k) = — I 0 2 J } K ' > and A 2 (K) = — ,/_l A ^ in the range ~ 0.1 — 1. 
For the power law examples considered this is a large range, 650 < K < 5300 
for two dimensions and 0.03 < K < 0.09 for three dimensions, i.e. there may 
be other modifications at the scales where the shot noise non-Gaussianity 
becomes relevant. 


5 Conclusions 


Poisson shot noise can produce an error matrix for the correlation function 
and power spectrum significantly different from that calculated in the Gaus¬ 
sian shot noise approximation. The primary contribution to the correlation 
function error matrix is diagonal and goes as w(9)/n p in two dimensions and 
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as £{r)/n p in three dimensions. For the power spectrum, there are additions 
to both diagonal and off-diagonal terms in the error matrix, and thus cor¬ 
relations are introduced in the power spectrum, even for a Gaussian density 
distribution. For SZ selected galaxy clusters, the errors in the two dimensional 
correlation function are well approximated by the Gaussian error except for 6 
extremely small. For the power spectrum, the cross correlations and additional 
errors are significant. Binning increases this effect as it reduces the Gaussian 
contribution relative to the Poisson contributions for the power spectrum. This 
increase in the error matrix due to the Poisson nature of shot noise should be 
taken into consideration both for parameter estimation and in survey design 
(e.g. a shallow survey increases the shot noise and thus these additions to the 
error and correlations for the power spectrum). 


There are two important caveats: The cluster density itself is non-Gaussian at 
small enough scales and thus at small enough distances these two contributions 
to the error matrix should be combined. It should be noted that due to non- 
Gaussianity, the error matrix alone will also be insufficient to calculate the full 
likclihood|_!£j The Poisson shot noise description may also break down in high 
density regimes (Casas-Miranda et al f 2002h ). It would be interesting to see 
the effects of the other non-Gaussian shot noise distributions which appeared 
in their analysis (sub- and super-Poisson). 


The analytic calculations here, just as in the Gaussian shot noise case, are of 
the most utility in estimating the power of future observations and guiding 
the corresponding observational strategies. When the eagerly awaited data is 
in hand, mock catalogues and various statistical strategies such as bootstrap, 
jack-knife and Monte Carlo (whichever is most appropriate for the question 
of interest, see e.g. Lupton fll993n for an introduction) will likely be needed 
to include the effects of both the complications discussed in this paper and 
additional observational aspects such as window functions. 


An additional point in this paper is that there is a rewriting of integrals used 
in the error matrix which eliminates most of the integrals over Bessel and 
spherical Bessel functions. This reduces much of the dependence of the error 
matrix on the tails of the power spectrum and correlation functions and also 
makes the error matrix integrals more tractable. 
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12 It is an interesting question when the error matrix is sufficient for calculating the 
full likelihood and how that compares to when the non-Gaussianity of the shot noise 
becomes important. I thank the referee for raising this issue. 
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errors in some of the integration measures. 
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Fig. 1. Upper left: Shot noise, Gaussian and full errors for w(9), with J\f = 35000, 
5 arcmin smoothing, and f s k y = 1/Att. The curved line is the angular power spec¬ 
trum for SZ selected galaxy clusters with Y mm = 1.7 x 10 -5 as mentioned in the 
text. Upper right: shot noise as fraction of total correlation function error for 5 
arcmin smoothing and N = 5000, 35000 and 65000 objects per steradian, and 
then for 15 arcmin smoothing and A f = 35000, top to bottom. Lower left: the ra¬ 
tio of using only errors calculated with the Gaussian approximation for Poisson 
noise, divided by the full errors, for all four cases. Lower right: The cross cor¬ 
relation, (w(9)w(30')) / yj(w (30' )w (30'))/ (w(O)w(O)) as a function of angle for the 
angular correlation function for 5 arcmin smoothing and three different cases of ob¬ 
ject numbers (5000,35000,65000 from bottom to top) and for 15 arcmin smoothing 
and TV = 35000 (very top). 
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Fig. 2. Upper left: angular power spectrum with full errors, Gaussian errors, and 
only shot noise errors, largest to smallest error bars respectively. Error bars are 
slightly offset. Upper right: shot noise only as fraction of total error for 20 K bins 
smoothed in log K and J\f = 5000, 35000 and 65000, and for 10 k bins and 20 equal 
K rather than log K bins with A/" = 35000. Lower left: Gaussian errors divided 
by the full errors, for all five cases. The lowest line (at far right of each figure) is 
for A f = 5000, the second lowest is that for using 10 bins, and then the next two 
going up are J\f = 35000,65000. The highest (dotted) line is for 20 equal bins in 
K rather than log K. Lower right: the cross correlation for the power spectrum, 
{P 2 {m)P 2 {K))/^/{P 2 (m)P 2 (m))/(P 2 {K)P 2 (K)) as a function of K for 20 log 
K bins and three different choices of object numbers (5000,35000,65000 from top 
to bottom). 
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Fig. 3. Top: the ratio between the Gaussian and full error for three different choices 
of Y m i n , 1.7 x 10 -5 ,3.4 x 1(T 5 ,1.0 x 10 -4 ; top to bottom. Bottom: the cross corre¬ 
lation for the power spectrum, (P- 2 (469)^2(A"))/ yj(P 2 (469)^2(469))/ (P 2 (K)P 2 (K)) 
for three different choices of F m j„, 1.7 x 10 —5 , 3.4 x 10 -5 ,1.0 x 10 —4 , bottom to top. 
In both figures there is also a line for for the power law example with Af = 35000; 
it is almost completely degenerate with that for Y m i n = 1.7 x 10“ 5 . 
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Fig. 4. Upper left: the ratio of the Gaussian error to the total error for 
the three dimensional correlation function. Upper right: the cross correlation 
(£M£(30))/v/(£(r)£(r))(£(30)£(30)) for the three dimensional power law ex¬ 
ample. For both, Af = 1CU 3 ,10 4 , 10 -5 top to bottom. Lower left: the ra¬ 
tio of the Gaussian error to the total error and right, the cross correlation 
(P(k)P( 0.1))/y/ (P(k)P(k))(P(0.l)P(0.l)) for the three dimensional power law ex¬ 
ample. Af = 10 -3 ,10 -4 ,10 -5 top to bottom left, and bottom to top right. No binning 
is assumed, so the peak in the cross correlation is very sharp. 
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